USAFA  TR  91-19 


AD'"A245  982 

_ ...  nail  ■•■II  ■mi  mil  mil  Ml  III! 


OVERVIEW  OF  TWO  COMMON 
STATISTICAL  COMPUTER  PACKAGES: 

SAS  AND  BMDP 


MAJOR  KAYE  A.  deRUZ 
DEPARTMENT  OF  MATHEMATICAL  SCIENCES 


APPROVED  FOR  PUBLIC  RELEASE;  DISTRIBUTION  UNLIMITED 


DEAN  OFTHE  FACULTY 
UNITED  STATES  AIR  FORCE  ACADEMY 
COLORADO  SPRINGS,  CO  80840 

92  2  12  033 


USAFA-TR-91-19 


Technical  Review  by  Major  Joe  Grisham 
Department  of  Behavioral  Science  and  Leadership 
USAFA  Academy,  Colorado  80840 

Technical  Review  by  Captain  Jim  Rutledge 
Department  of  Mathematical  Sciences 
USAF  Academy,  Colorado  80840 

Editorial  Review  by  Lt  Col  Donald  C.  Anderson 
Department  of  English 
USAF  Academy,  Colorado  80840 


This  research  report  entitled  “Overview  of  Two  Common 
Statistical  Computer  Packages:  SAS  and  BMDP"  Is  presented  as  a 
competent  treatment  of  the  subject,  worthy  of  publication.  The 
United  States  Air  Force  Academy  vouches  for  the  quality  of  the 
research,  without  necessarily  endorsing  the  opinions  and 
conclusions  of  the  author. 

This  report  has  been  cleared  for  open  publication  and  public 
release  by  the  appropriate  Office  of  Information  In  accordance 
with  AFM  190-1,  AFR  12-30,  and  AFR  80-3.  This  report  may  have 
unlimited  distribution. 

ROBERT  K.  MORROW  JR.,  Lt  Col,  USAF  :z3 
Director  of  Research  Dated 


REPORT  DOCUMENTATION  PAGE 


Form  Approved 
0MB  NO  0704  0188 


♦Of  tr\»%  01  >s  lo  ••♦r  -jet  -^\oot‘\e,  >n<iuain9  Ih^  »0f  ffw»ewi«q  ifniructiOfH.  >e#fc*»«no  00:4 

«O0  m4«ftt«ioiA9  the  04t«  Ae«o^.  AOd  <.orrm«un^  eec  fev>^<n<r-3  xr-e  ot  •''*or<**4t<on  comm4>fits  rfxaaraifM)  (h»t  byrO#f>  *nt<m«te  or  .«nv  oir^  j^oeci  ot 

<e<*«CttOO  of  lO^OroiiAtiOO.  •A<iudi09  w09V%IiOa%  ♦or  r*OuiiA9  thi>  OwfOen  tv'»  •<»MO««.irtO'%  S«/ipi‘  es  Oir^tOf.ite  'or  mUitFeMttm  Oo^-»ooo%  «n0  fi»oC'i».  12  is  ie^tef^on 

Suito  W04  Ariirw}ton.  VA  <in4  to  (♦«•  <>♦*»«#  ^»  Vl4n4.)««**r«t  400  HwOi**  ^ «0^r«%mi  NfMuttiOr' i0/C4  0^9$).  WAtrtin^fon.  r^C  ^OSO) 


1 1.  AGENCY  USi  ONLY  a«4v«  Oler^k) 


TITLE  ANO  SUBTITLE 


REPORT  DATE 

December  1991 


I  3.  REPORT  TYPE  ANO  DATES  COVERED 


rs.  FUNDING  NUMBERS 


Overview  of  Two  Statistical  Packages:  SAS  and  BMDP 


I  6.  AUTHOR(S) 


Major  Kaye  A.  de  Ruiz 


7.  PERFORMING  ORGANI2ATION  NAME(S)  ANO  AOORES$(LS) 

Department  of  Mathematical  Sciences 

United  States  Air  Force  Academy,  Colorado  80840 


8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


USAFA  TR  91-19 


I  9.  SPONSORING/ MONITORING  AGENCY  NAME{S)  ANO  AOOHESSIES) 


10.  SPONSORING /MONITORIKE 
AGENCY  REPORT  NUMBER 


11.  SUPPLEMENTARY  NOTES 


I  12>.  distribution  /  AVAILABILITY  STATEMENT 


12b.  DISTRIBUTION  CODE 


13.  ABSTRACT  {M/iximum  200  •r.  oeu:) 

This  report  provides  a  brief  introduction  to  two  statistical  computer 
packages,  SAS  and  BMDP.  The  overall  approach  used  by  these  packages  to  Input 
data,  perform  statistical  analyses,  etc.,  is  discussed.  A  common  data  set  Is 
provided  and  used  to  give  examples  of  programming  steps  and  the  corresponding 
output.  Examples  of  simple  statistical  computations  such  as  descriptive 
statistics,  linear  regression,  and  analysis  of  variance,  are  all  presented 
along  with  a  reference  for  more  advanced  techniques.  This  paper  Is  not  meant 
to  replace  the  associated  manuals,  but  to  provide  a  beginner  with  enough  Infor¬ 
mation  to  read  the  manuals  easily. 


I  14.  SUBJECT  TERMS 


SAS,  M8>P,  Linear  Regression,  Analysis  of  Variance 


15.  NUMBER  OF  PAGES 

17 

16.  PRICE  CODE  ^ 


17.  SECURITY  CLASSIFICATION  18.  SECURITY  CLASSIFICATION  I ’9.  SECURITY  CLASSIFICATION  (  20.  LIMITATION  OF  ABSTRACT 

OF  REPORT  OF  THIS  PAGE  I  OF  ABSTRACT  f 


Unclassified 


Unclassified 


Unclassified 


^ rs^o^’-iw-ssoo  . 

/I*':  .  v- 


Sra"da'd  =orm  298  (Rev  Z-fiB' 

4*4S.  >18  /V».  ‘N  . 


OVERVIEW  OF  TWO  COMMON 
STATISTICAL  COMPUTER  PACKAGES: 
SAS  AND  BMDP 
by 

Major  Kaye  A.  deRuiz,  USAF 


Department  of  Mathematical  Sciences 
United  States  Air  Force  Academy 
Colorado  Springs,  Colorado  80840 


TABLE  OF  CONTENTS 


Page 


I.  Introduction . 1 

1 1 .  The  Data  Set . 1 

III.  SAS . 3 

IV.  BMDP . 10 

V.  Program  Reference  Guide . 14 

I  VI.  References . 15 


A 


TABLE  OF  FIGURES 


Figure 


III-l 

SAS 

1 1 1-2 

SAS 

III-3 

SAS 

III-4 

SAS 

1 1 1-5 

SAS 

IV-6 

BMDP 

IV-7 

BMDP 

IV-8 

BMDP 

IV-9 

BMDP 

Scatter  Plot . 

Linear  Regression  Model  1 . 

Linear  Regression  Model  2 . 

ANOVA  Without  Interaction . 

ANOVA  With  Interaction . 

Histogram . 

Regression . 

Predicted  Values  and  Residuals 
ANOVA . 


.4 

.5 

.6 

.8 

.9 

11 

12 

12 

13 


Aooession  For 

HTl S  GRAil 

\ar 

DTIC  TAP 

□ 

Unennciinoed 
Ju3l in loa_ 

□ 

Py 

Blptrlbutlon/ 

Avelle.bl itty  Code*  I 

iDlst 


r' 


A veil  aod/or 
Speoial 


I.  INTRODUCTION: 

In  statistics  courses  at  USAFA,  MYSTAT  Is  used  to  solve  problems  in  linear 
regression  and  analysis  of  variance  because  it  works  well  ,  is  easy  to  use.  and 
is  inexpensive.  There  are  other  statistical  packages  that  are  more  commonly  used 
for  large  data  sets,  which  are  usually  more  powerful  and  used  in  conjunction 
with  a  mainframe  computer.  In  recent  years,  three  statistical  programs  have 
become  well-known  throughout  civilian  institutions:  SAS,  6MDP,  and  SPSS.  At 
the  Air  Force  Academy,  SPSS  is  used  in  social  science  and  economics  classes 
since  it  is  widely  used  in  that  realm.  This  pamphlet  introduces  you  to  SAS  and 
BMDP  so  that  you  can  perform  simple  statistical  computations  using  these 
packages.  This  information  does  not  replace  the  manuals  for  these  packages,  but 
should  enable  you  to  use  the  manuals  more  adeptly. 

II.  THE  DATA  SET: 

To  compare  the  use  of  the  two  packages,  we  will  employ  a  common  data  set 
taken  from  "Minitab  Student  Handbook"  (1976  edition). 

Students  in  an  introductory  statistics  course  participated  in  an  in-class 
experiment.  The  students  took  their  own  pulse  rate.  They  were  then  asked  to 
flip  a  coin.  If  their  coin  came  up  heads,  they  were  to  run  in  place  for  one 
minute.  Then  everyone  took  their  own  pulse  again.  The  pulse  rates  and  some 
other  data  are  given  on  the  next  page. 
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III.  SAS: 


The  SAS  computer  program  was  originally  developed  for  a  mainframe 
computer.  More  recently,  the  package  has  been  adjusted  to  operate  on  personal 
computers  as  well.  The  SAS  program  has  two  main  parts — data  and  procedural. 
The  data  portion  is  a  collection  of  data  with  variable  names,  while  the 
procedural  (PROC)  steps  are  used  to  analyze  the  data.  A  simple  SAS  Job  might 
look  like  the  following: 

DATA  PULSE; 

INPUT  PULSEl  PULSE2  HEIGHT  WEIGHT; 

CARDS; 

62  76  73.5  160 

64  84  73  165 

:  (continue  typing  in  all  data  to  be  used) 

76  76  26  108 


PROC  PLOT  DATA  =  PULSE; 

PLOT  PULSEl  •  HEIGHT; 

The  above  steps  would  lead  to  the  plotting  of  the  data  points  using  PULSEl 
and  the  corresponding  height.  PULSE  was  chosen  as  the  name  of  the  data  set  we 
are  using  since,  with  many  data  sets,  we  need  to  identify  in  PROC  precisely 
which  set  we  want  to  analyze.  Note  that  every  SAS  statement  (excluding  the  data 
lines)  ends  with  a  semicolon.  Indentation  is  not  required  in  SAS,  but  it  makes 
it  easier  to  find  the  DATA  and  PROC  steps. 

The  CARDS  statement  tells  SAS  that  the  data  lines  follow.  There  are  other 
ways  to  input  data;  for  exaunple,  large  data  sets  may  be  read  directly  from  ASCI 
computer  files.  However,  since  inputting  data  directly  is  the  most  common 
method,  we  will  not  discuss  alternative  methods  here.  Other  methods  are 
discussed  in  the  reference  manuals  accompanying  SAS. 

The  PROC  statement  specifies  which  procedure  and  data  set  you  will  use. 
The  range  of  analyses  that  can  be  accomplished  on  SAS  are  very  extensive.  PROC 
statements  are  used  to  perform  simple  analysis  of  variance  (ANOVA)  and  linear 
regression  as  well  as  more  complicated  analyses  such  as  factor  analysis, 

discriminant  analysis,  and  stepwise  regression. 

Figure  1  on  the  next  page  shows  the  results  of  the 

plot  requested  in  the  above  program. 
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I’lgur*  III-l  SAS  Scatter  Plot 


SAS 


Plpt  O'f  PULSEl'N’HEIGHT.  Legend:  A  «  1  obs,  B  »  2  obs,  etc. 
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The  SAS  Statistics  Manual  discusses  all  of  the  procedures  available  and 
gives  many  examples  of  how  to  use  each  type  of  procedure.  Many  of  the  terms 
used,  such  as  “PRESS  residuals,"  may  not  be  familiar  to  you,  and  there  are 
certain  tests  we  cannot  cover  in  this  course.  However,  if  you  continue  in 
statistics  or  in  an  area  that  utilizes  statistics,  you  will  find  many  of  the 
methods  quite  useful. 

By  using  the  pulse  data  in  SAS.  the  following  steps  would  lead  to  linear 
regression.  The  output  is  also  included  in  Figures  2  and  3  on  the  next  page. 

DATA  PULSE; 

INPUT  PULSEl  RAN  WEIGHT  ACTIVITY; 

CARDS; 

(Data  lines) 


PROC  REG  DATA  »  PULSE 

MODELl  PULSEl  »  WEIGHT  ACTIVITY; 

M0DEL2  PULSEl  ■  RAN  WEIGHT  ACTIVITY; 

The  MODEL  statement  in  PROC  REG  contains  the  name  of  the  dependent  variables 
you  want  to  include  in  the  model.  (See  Figures  2  and  3.) 


Figure  III-2  SAS  Linear  Regrecr'.ion  Model  1 

odel !  MODELl 


ependent  Variable: 

PULSEl 

Analysis  o+  Variance 

Source 

Sum  a+  Mean 

DF  Squares  Square 

F  Value 

Model 

Error 

C  Total 

2  604.46914  302.23457 
89  1 0407 . 4O043  116. 93708 
91  11011.86957 

2.585 

•  Root  MSE 

Dep  Mean 

C.  V. 

10.81375  R-square 
72.84783  Adj  R-sq 
14.84429 

0.0549 

0 . 0337 

Prob>F 
0 .0811 


Parameter  Estimates 


Par-ameter 

Standard 

T  for  HO: 

Variable 

DF 

Estimate 

Error 

Parameter=0 

Prob  >  IT! 

XNTERCEP 

1 

87.875383 

7. 13929274 

12.309 

0.0001 

WEIGHT 

1 

-0.093625 

0.04775285 

-1.961 

0.0531 

ACTIVITY 

jet' 

-  1 

-0.615189 

0.52732534 

-1.167 

0. 2465 
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Figure  II 1-3  SAS  Lliieer  Regression  Model  2 
iodel  >  liODEL2 

DcpBndant  Variable:  PULSEl 

Analysis  0+  Variance 


Sum  0+  Mean 

Source 

DF  Squares  Square 

F  Value 

Prob  >F 

Model 

3  741. 

07539  247.02513 

2.117 

0. 1038 

Error 

88  10270. 

79418  116.71357 

C  Total 

91  11011. 

86957 

Root 

MSE 

10.80341 

R-square 

0.0673 

Dap 

Mean 

72.84783 

Adj  R-sq 

0.0355 

C.V. 

14.83010 

Parameter  Estimates 

Parameter 

Standard  T 

■for  HO: 

Variable 

DF 

Estimate 

E  ror  Parameter®©  Prob 

1  >  it; 
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93.926983 

9.06427323 
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0.0001 

RAN 
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-2.590996 

2.39492590 
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0.2823 

WEIGHT 
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0.04890820 

-2. 153 

0. 0341 

ACTIVITY 

i 
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To  perform  Analysis  of  Variance  (ANOVA),  you  use  PROC  ANOVA.  You  can  do  a 
simple  one-way  ANOVA  or  an  n-way  analysis  of  variance  for  balanced  data.  The 
program  steps  for  the  one-way  ANOVA  follow: 

DATA  PULSE; 

INPUT  PULSE2  RAN  SMOKES; 

CARDS; 

(Data  lines) 


PROC  ANOVA  DATA  «  PULSE; 
CLASS  RAN; 

MODEL  PULSE2  -  RAN; 
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For  an  ANOVA  involving  more  than  one  factor,  list  all  classification  factors 
in  the  CLASS  statenent.  You  then  use  the  MODEL  statement  to  list  the  terms  you 
want  in  the  model.  Interaction  terms  are  given  by  listing  the  variables  with  an 
*  between  them  (i.e.,  SMOKES*RAN). 

Steps  for  two-way  ANOVA  with  and  without  interaction  terms  are  given  below, 
with  Figures  4  and  5  on  the  following  pages  containing  the  results  of  chese 
examples. 


EXAMPLES: 

I.  Two-way  ANOVA  without  interaction: 


DATA  PULSE; 

INPir  PULSEl  PULSE2  RAN  SMOKES; 
CARDS; 

(Data  lines) 


PROC  ANOVA  DATA  »  PULSE; 

CLASS  RAN  SMOKES; 

MODEL  PULSE2  »  RAN  SMOKES; 

II.  Two-way  ANOVA  with  interaction: 

DATA  PULSE; 

INPUT  PULSEl  PULSE2  RAN  SMOKES; 
CAROS; 

(Data  lines) 


PR(X:  ANOVA  DATA  »  PULSE; 

CLASS  SMOKES  RAN; 

MODEL  PULSE2  »  SMOKES  RAN  SMOKES»RAN; 


Analysis  of  Variance  Procedure 
Class  Level  Information 

Class  Levels  Values 

RAN  2  12 

SMOKES  2  12 

Number  of  observations  in  data  set  -  92 


Analysis  of  Variance  Procedure 


Dependent 

Source 

Variable:  PULSE2 

DF 

Sum  of 
Squares 

Mean 

Square 

F  Value 

Pr  >  F 

Model 

2 

8759.67437 

4379.83718 

22.06 

0.0001 

Error 

89 

17673.93433 

198.58353 

Corrected 

Total  91 

26433.60870 

R-Square 

c.v. 

Root  MSE 

PULSE2  Mean 

0.331384 

17.62933 

14.0920 

79.934783 

Source 

DF 

Anova  SS 

Mean  Square 

F  Value 

Pr  >  F 

RAN 

SMOKES 

1 

1 

8697.40719 

62.26718 

8697.40719 

62.26718 

43.80 

0.31 

0.0001 

0.5769 

Figure  III-4 


SAS  ANOVA  Without  Interaction 


Anaj.yaj.a>  ot  variance  ^roceaure 
Class  Level  Information 

Class  Levels  Values 

RAN  2  12 

SMOKES  2  12 

Number  of  observations  in  data  set  =  92 


Analysis  of  Variance  Procedure 


Dependent  Variable 

Source 

Model 

Error 

Corrected  Total 


PI7LSE2 

Sum  of 
DF  Squares 

3  8871.71262 

88  17561.89608 

91  26433.60870 


Mean 

Square  F  Value 
:tf957. 23754  14.82 

199.56700 


Pr  >  F 
0.0001 


R-Square 


C.V.  Root  MSE 


PULSE2  :.ean 


0.335622  17.67293 


14.1268 


79.934783 


Source 


DF 


Anova  SS  Mean  Square  F  Value  Pr  >  F 


RAN 

SMOKES 

RAN*SMOKES 


1 

1 

1 


8697.40719 

62.26718 

112.03825 


8697.40719 

62.26718 

112.03825 


43.58  0.0001 

0.31  0.5779 

0.56  0.4557 


Figure  III-5  SAS  ANOVA  With  Interaction 
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This  statistical  computer  package  Is  popular  in  social  sciences  and 
education.  It  can  be  quite  easy  and  convenient  to  use  for  complex  statistical 
analyses.  BMDP  is  different  from  MYSTAT,  SAS,  or  SPSS  in  that  it  is  a 
collection  of  programs  that  each  perform  a  different  kind  of  analysis.  SAS  and 
MYSTAT,  as  you  have  seen,  are  single  programs  that  perform  many  kinds  of 
analyses.  When  using  BMDP,  you  must  know  which  programs  you  want  to  use. 

A  BMDP  program  consists  of  paragraphs.  Sentences  in  the  program  end  with  a 
period,  while  paragraphs  begin  with  a  slash.  The  paragraphs  that  describe  the 
data  are  common  to  all  of  the  B4DP  programs,  but  the  paragraphs  that  describe 
the  anlysis  to  be  done  are  different  for  each  program.  A  simple  BMDP  program 
using  PDS  follows: 

/INPUT  VARIABLES  »  4. 

FORMAT  *  FREE. 

/VARIABLES  NAMES  «  PULSEl,  PULSE2,  HEIGHT.  HEIGHT. 

/PLOT  TYPE  *  HIST. 

VARIABLE  «  PULSEl. 

/END. 

(The  data  is  entered  here. } 

This  program  has  four  paragraphs.  The  INPUT  paragraph  contains  a  VARIABLES 
and  FORMAT  statement  to  identify  how  many  variables  will  be  read  and  In  what 
format  they  will  be  read.  The  NAMES  sentence  assigns  names  to  the  variabl'..^3. 
The  PLOT  paragraph  describes  the  analysis  we  want  performed  and  then  identifies 
the  variable  to  use  in  the  analysis.  The  last  paragraph  signifies  the  end  of 
the  program  which  is  then  followed  by  the  data  lines.  See  Figure  6  on  the  next 
two  pages  for  the  results  of  the  BMDP  analysis.  The  BMDP  output  always  Includes 
the  program  statements  prior  to  any  analysis  while  SAS  output  does  not  provide 
this  without  additional  commands. 

Regression  programs  in  BMDP  all  eiui  with  the  letter  R;  the  program  to  do 
standard  multiple  regression  is  BMDPIR.  The  following  is  an  example  of  the 
BMmiR  program.  (See  Figure  7  for  the  output. ) 

/INPUT  VARIABLES  »  3. 

FORMAT  ■  FREE. 

/VARIABLE  NAMES  »  SEX,  HEIGHT,  HEIGHT. 

/REGRESS  DEPENDENT  «  HEIGHT. 

INDEPENDENT  -  HEIGHT.  SEX. 

/END 

(Data  lines) 


BnOPSO  ■  HISTOORMI  MO  UNIVMIRTt  RIOTS 
moR  statistical  SOTTHARE,  IM. 

1W4  UESTWDD  BLVD.  SUITE  202 
(213)  47S-Sm 

RtOORAM  REVISED  OCTOKR  19S3 
MANUAL  REVISED  --  ;S«3 

CORTRIOHT  <C>  1983  REOEMTS  Or  UNIVERSITY  Of  CALIfORNlA 

TO  SEE  REHARXS  AND  A  SUMflARY  Of  NEW  fEATURES  FOR 
THIS  RRODRAH.  STATE  NEWS.  IN  THE  RRINT  RARACRAPH. 
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rrodrah  control  inforhation 

/INPUT  VARIABLES'A. 

FORHAT'fREE. 

/VARIAfcC  NAHES-PULSEI .RULSER.HEIONr.WEIONT. 

/PLOT  TYRE'HIST. 

variable -RULSEI. 

/END 


TABLE  or  CONTENTS 


VARIABLE  CROUP  PLOT 

NO.  NAME  name  type 

I  Rulsei  Nisr 


HISTOCRAH  Of  VARIABLE  I  PULSCI 

SYflIOL  COUNT  MAN  ST.OCV. 

I  It  72.7St  lO.lOl 

EACN  SYNBOL  REPRESENTS  I  OBSERVATIONS 


INTC9VAI. 
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8 

14 

IS 

20 

28 

30  38 

40 

48 
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84 

88 

70 

78 

80 

mz 
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0 

0 
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0 

0 
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t 
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3 

4 
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4 
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5 
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•1 

! 

6 
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SO.O 
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0 

C 

0.0 
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0 

6 

0.0 
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*x 

1 

7 

8.3 

S8.3 
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0 

7 

0.9 

S8.3 
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1 

a 
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0 

a 

0.0 
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0 

8 
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0 

8 
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2 

10 

16.7 
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0 

10 

0.0 
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10 

0.0 
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0 

10 

0.0 
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*I 

1 

11 
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0 

11 

0.0 

91.7 
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0 

n 

0.0 

91.7 
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0 

11 

0.0 

31.7 

•87.8740 

0 

11 

0.0 

91 .7 

•88.7404 

0 

It 

0.0 

91.7 
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0 

11 

0.0 

91.7 
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0 

It 

0.0 

91.7 

•81.3844 

• 

11 

i.e 

91.7 

•88.2844 

•1 

t 

12 
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100.8 

8 

14 

19 

20 

28 

34  38 

40 

48 
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84 

88 

74 

78 

84 
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CPU  TIM  UMO  I.IST  StCONW 

Figure  IV-6 


NUMBER  or  variables  to  read  in. 

nuhber  OF  variables  added  by  TRANSFORNAT ions' 

total  nuhber  of  variables 

number  or  CASES  TO  read  in.  ' 

CASE  LABElINC  variables  ....... 

HissiNC  values  checked  before  or  after  trans 

blanks  are . 

INPUT  UNIT  number  ......... 

HUIMB  IHfur  UNIT  PRIOR  TO  READING. 

HUMBER  or  WORDS  Of  DYNAMIC  STORAGE. 


Data. 


variables  to  be  used 

I  PUL5E1  2  PULSE2 

INPUT  FORMAT  IS 
FREE 


3  NEIGHT 


4 

0 

4 

TO  END 

NEITHER 

MISSING 

s 

HO 

S04I4 


4  WEIGHT 


HATIMUM  LCXBTH  DATA  RECORD  IS  BO  CHARACTERS. 


GROUPING  variable  .  . 

NUMBER  Of  CASES  READ. 


(NONE  GIVEN) 
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PAGE 

NO. 


BMDP  Hlctograa 


In  addition,  this  program  will  also  print  the  predicted  values  and 
residuals,  along  with  the  variables  in  the  data  set.  To  do  this,  Insert  /PRINT 
DATA,  between  the  REGRESS  and  END  paragraphs.  The  output  is  shown  below  in 
Figure  8. 


PMC  Z  BHDPIR 

■CMCSSIQN  titlc  is 

JUNC  S,  19*4  I4;37;«l 


I 


KPCNDCNT  VARIPBLC .  J  HCIMT 

TQLCRPNCC  .  t.llOO 

ALL  DATA  CQNSIBCRCD  AS  A  SINCLC  DRCJP 


MIITIPI.C  R  «.9S7t  STD.  CRROR  OF  CST. 

nULTlPLC  R-SDUARC  0.9160 


ANALYSIS  ar  VARIANCC 

SUN  at  SOUARCS 
RCCRCSSION  648S.6437 
RCSIOUAL  S9S.4$96 


OF  nCAN  SQUARE 
Z  324Z.9ZI9 
9  66.1177 


(.1313 


F  RATIO  PCTAIL) 
49.048  0.0000 


VARIAtLt 


STO.  REG 

COCFFICICNT  STO.  ERROR  COEFF 


P(Z  TAIL)  TOLERANCE 


INTERCEPT 

scs 

NCICHT 


•41.1137* 

-24.09193  7.8886S 

3.1*490  0.979S1 


0.489  -3.0S4  0.0137  0.36427 

0.S21  3.2SI  0.0100  6.36427 


NWliC*  or  INTEGER  UQROS  OF  STORAGE  USED  IN  PRECEDING  PROBLEN  614 
CPU  TINE  USED  0.140  SECONDS 


Fleure  IV-7  BMDP  Re^reocicr. 


list  or  PREOICTEO  VALUES.  RESIDUALS.  ANO  VARIABLES  , 

NOTE  -  negative  CASE  NUNBER  oenotcs  a  case  with  nissing  values. 

THE  NURBCR  OF  standard  OCVIATIDHS  FRQn  THE  NEAN  IS  DENOTED  BY  UP  TO  3  ASTERISKS  TO  THE  RIGHT 
OF  EACH  RCSIOUAL  OR  variable. 


H(SS{NC 

values 

ANO  VALUES 

Our  Of  range  are 

OENOreo  BV  values 

CIICATCS 

THAN  Qfl 

1  equal  to 

0.2127e«38  (H  ABSOLUTE  VALUE. 

CASE 

RCSIOUAL 

PRCDICTEC 

variables 

LABEL 

NO. 

value 

1  SEi 

2  MEIGrT 

3  HEIGHT 

1 

-8.977 

«  168.9 

1.000 

73. SO 

* 

160.0 

2 

•2.28S 

167.3 

1 .000 

73.00 

• 

165.0 

3 

•9.100 

•  164.1 

1 .000 

72.00 

155.0 

4 

-4.731 

1S7.7 

1.000 

70.00 

153.0 

$ 

1.451 

110. S 

2.000  • 

62. 7S 

« 

112.0  • 

$ 

•2.289 

127.3 

2.000 

68.00 

125.0 

7 

19.53 

••  170. S 

1.000 

74.00 

« 

190. 0  • 

9 

1 .824 

148.2 

1 .000 

67.00 

1S0.0 

9 

3.639 

151.4 

1 .000 

68.00 

155. 0 

10 

0.6929 

119.3 

2. 000  • 

65. SO 

120.0 

11 

0.2853 

117.7 

2.000  • 

65.00 

118.0 

12  • 

0.1602 

188.2 

2.008  • 

62.00 

106.0  * 

Figure  IV-8  BMDP  Predicted  Values  and  Residuals 
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To  perform  a  one-way  analysis  of  variance,  program  BMDPIV  is  used.  This 
program  requires  an  additional  DESIGN  paragraph  with  a  DEPENDENT  sentence.  An 
additional  GROUPING  sentence  Identifies  the  classification  variable,  placed  In 
the  VARIABLE  paragraph.  The  BMDPIV  program  and  the  output  is  shown  below. 


/INPUT  VARIABLES  =  3. 

FORMAT  =  FREE. 

/VARIABLE  NAMES  =  PULSE2,  RAN,  SMOKES. 
GROUPING  »  RAN. 

/DESIGN  DEPENDENT  =  PULSE2. 

/END 

(Data  lines) 


•  ••  ANALrSIS  or  VAtlANCC 

PULSC2 

8V  (AN 


sun  OF 

nCAN 

SICNIF 

souAce  or  vaaiation 

SOUMCS 

DF 

SQUARE 

F 

OF  F 

NAiN  crrccrs 

tS87.«Q0 

1 

1587.090 

8.263 

0.9:7 

(AN 

tS07.48O 

1 

1S87.000 

8.263 

0.017 

ClALAINCO 

tS87.aeQ 

1 

1587. 000 

8.263 

0.017 

ACS  1  dual 

1928. fi€7 

10 

192.067 

total 

9507.867 

11 

918.879 

Figure  IV-9  BMDP  ANOVA 


The  program  BMDP2V  can  be  used  for  three-way  and  higher  interactions.  You 
can  include  or  exclude  interactions  and  main  effects  in  the  DESIGN  paragraph  as 
shown  below.  The  numbers  in  these  statements  Indicate  the  order  the  factors 
were  presented  in  the  grouping  statement.  For  example,  RAN  corresponds  to  1, 
SMOKES  corresponds  to  2,  and  23  represents  the  interaction  between  SMOKES  and 
SEX. 

/DESIGN  DEPENDENT  =  PULSE2. 

GROUPING  =  RAN,  SMOKES,  SEX. 

EXCLUDE  »  23,  123. 

(Excludes  the  interaction  between  SMOKES  and  SEX  and 
between  all  three  Independent  variables. ) 

/DESIGN  DEPENDENT  -  PULSE2. 

GROUPING  -  RAN,  SMOKES.  SEX. 

INCLUDE  -  1.  2.  3.  12.  13. 

(Includes  all  three  Independent  variables  and  the 
interactions  between  RAN  and  SMOKES  and  between  RAN  and 
SEX. ) 

(Data  lines) 


V.  PROGRAM  REFERENCE  GUIDE: 


Following  Is  a  partial  list  of  capabilities  and  the  procedure  or  program 
name  used  to  access  the  package. 


CAPABILITY 

SAS 

BMPD 

Histograms 

CHART 

5D 

PLOTS 

PLOT 

6D 

DATA  SCREENING 

4D, 

AM 

Descriptive  Statistics 

TTEST,  MEANS, 

ID, 

2D, 

3D 

UNIVARIATE, 

SIMIARY, 

CORR 

8D, 

9D 

Regression 

Multiple  Linear 

REG 

IR 

Stepwise 

STEPWISE 

2R 

All  possible  subsets 

RSQUARE 

9R 

Nonlinear 

NLIN 

3R, 

AR 

Logistics 

LOGISTIC,  CATMOD 

LR 

Other 

RSREG 

4R, 

5R, 

6R 

Analysis  of  Variance 

Balanced 

ANOVA 

IV, 

2V, 

8V 

Unbalanced 

GLM 

IV, 

2V, 

3V 

ANCOVA 

GLM 

IV 

Repeated  Measures 

GLM 

2V 

Other 

Nested 

VARCOMP 

Models  for  Categorical  Data 

Contingency  Tables 

FUNCAT 

Log-linear  Models 

CATMOD 

4F 

Nonparametrics 

NPARIWAY 

Multivariate  Techniques 

Canonical  Correlation 

CANCORR 

6M 

Discriminant  Analysis 

DISCRIM 

NEIGHBOR 

CANDISC 

STEPDISC 

7M 

Principal  Components 

PRINCOMP 

Factor  Analysis 

FACTOR 

4M, 

8M 

Cluster  Analysis 

CLUSTER 

IM, 

2M. 

3M, 

FASTCLUS 

VARCLUS,  TREE 

KM 

MANOVA 

GLM 

4V 
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